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(57) The invention concerns DNA encoding pro- 
lylendopeptidase, hybrid vectors containing such DNA, 
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dopeptidase comprising the steps of: culturing a host 
organism transformed with an expression vector com- 
prising a DNA coding for prolylendopeptidase and 
optionally, recovering the produced prolylendopepti- 
dase; and a process for the production of a C-terminal 
amidated peptide from two precursors thereof, compris- 
ing the steps of: placing the two precursors in contact 
with a prolylendopeptidase in a medium to convert the 
precursor peptides to the C-terminal amidated peptide, 
and recovering the resulting C-terminal amidated pep- 
tide. 
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Description 

[0001] The present invention relates to recombinant DNA coding for prolylendopeptidase and a process for the pro- 
duction of said DNA, a host transformed with the recombinant DNA and a process for the production of said transformed 
5 host, a process for the production of prolylendopeptidase using the transformed host, and use of the prolylendopepti- 
dase to produce a physiologically active N-terminally amidated peptide. 

BACKGROUND OF THE INVENTION 

10 [0002] Prolylendopeptidase was first found in human uterus by Walter et al., in 1971 as a specific endopeptidase 
which cleaves a peptide at the carboxyl- terminus side of a proline residue (Walter, R. et al. Science, 1971, 173, 827- 
829), and ever since the enzyme has been continuously studied with regard to its physiological role. 
[0003] On the other hand an endopeptidase showing a very similar specificity to mammalian prolylendopeptidase was 
found from a bacterium, Flavobacterium meningosepticum in 1978 (Yoshimoto, T, et al. Agric. Biol. Chem. 

is 1978,42,2417-2419). This finding enabled a larger amount to be prepared (but still at a lab scale) of the enzyme, and 
prolylendopeptidase became available for specific cleavage (Yoshimoto, T, et al. J. Biol. Chem. 1980, 255,4786-4792.) 
of proteins and peptides. Its unique specificity, recognizing proline residues, makes the enzyme quite useful as a basic 
tool of protein engineering and draws more attention to the study of the structure and function relationship. The prepa- 
ration of prolylendopeptidase from F. meningosepticum . however, has the following two crucial drawbacks arising from 

20 the bacterium. 1) The bacterium is pathogenic (Yoshimoto, T. et al., 1978, supra; Buchanan, R.E. et al. "Bergey's Man- 
ual of Determinative Bacteriology," 8th ed. 1974, The Williams & Wilkins Co., Baltimore.) and 2) it produces not only pro- 
lylendopeptidase but also significant amounts of other specific or non-specific peptidases (Yoshimoto, T. et al., 1978 
supra). These problems have prevented industrial production of the endopeptidase, in spite of a growing demand for 
the enzyme. 

25 [0004] Japanese Unexamined Patent Publication (KOKAI) No. H2-5880 describes cloning of post-proline peptidase 
gene derived from Bacteroides qingivalis, but this enzyme is clearly different from the present enzyme in that the former 
cleaves glycyl-proline-4-methoxy-p-naphtylamide which cannot be cleaved by the present prolylendopeptidase. 
[0005] D. Rennex et al., Biochemistry 30,2195-2203, 1991 describes a cloning of cDNA for prolylendopeptidase from 
the porcine brain, but does not describe an expression of the cDNA. 

30 [0006] It is believed that prolylendopeptidase is useful for modification of peptides, for example, C-terminal amidation 
of biologically active peptides such as LH-RH, oxytocin, calcitonins or the like, but for this purpose, it is necessary to 
obtain a large amount of the enzyme. Moreover, the enzyme preparation should be free of other peptidases, to ensure 
a desired reaction. Thus, the production of the enzyme by a gene recombination process is essential. 

35 SUMMARY OF THE INVENTION 

[0007] Accordingly, the present invention provides a process for the production of prolylendopeptidase by gene engi- 
neering. As a precondition of the recombinant process, the present invention also provides recombinant DNA mole- 
cules comprising a gene coding for prolylendopeptidase and a process for the production thereof, as well as a host 

40 transformed with the recombinant DNA molecule and a process for the production thereof. 

[0008] More particularly, the present invention provides a process for the production of a recombinant DNA molecule 
comprising a gene coding for prolylendopeptidase, comprising the steps of preparing cDNA or genomic DNA from cells, 
preferentially bacterial cells, capable of producing prolylendopeptidase, inserting DNA fragments coding for prolylen- 
dopeptidase into a cloning vector, and selecting a hybrid vector containing the DNA coding for prolylendopeptidase. 

45 [0009] The present invention also relates to the DNA molecules prepared as above. 

[001 0] The present invention also relates to an expression vector comprising a gene coding for prolylendopeptidase 
of any origin and expression control sequences operably linked with the gene. 

[001 1 ] The present invention further provides a host transformed with the gene coding for prolylendepeptidase of any 
origin, and capable of producing the prolylendopeptidase. 
so [001 2] The present invention still further provides a process for the production of prolylendopeptidase comprising the 
steps of: 

culturing a host organism transformed with an expression vector comprising a DNA coding for prolylendopeptidase 
to produce the prolylendopeptidase, and optionally, recovering the produced prolylendopeptidase. 

55 

[001 3] The present invention further provides a process for the production of a C-terminal amidated peptide from two 
precursors thereof, wherein one of the precursors is a precursor peptide forming N-terminal region of the C-terminal 
amidated peptide and having a proline residue at its C-terminus and another precursor is a precursor peptide or amino 
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acid forming a C-terminal portion of the C-terminal amidated peptide which precursor peptide or amino acid has been 
C-terminally amidated, comprising the steps of: 

placing the two precursors in contact with a prolylendopeptidase in a medium to convert the precursor peptides to 
5 the C-terminal amidated peptide, and recovering the resulting C-terminal amidated peptide. 

DETAILED DESCRIPTION OF THE INVENTION 

Preparation of gene coding for prolylendopeptidase 

10 

[001 4] Hereinafter, the term prolylendopeptidase is intended to include any prolylendopeptidase. The preferred mean- 
ings of the term, however, is a prolylendopeptidase derived from procaryotes, preferably from Flavobacterium spec, in 
particular from F. mennigosepticum . most preferentially from F. meningosepticum strain IFO 12535 (ATCC 13253). 
[001 5] Accordingly, a DNA of the present invention coding for a prolylendopeptidase may be cloned from any cell con- 
15 taining a gene coding for prolylendopeptidase, including cucaryotes such as mammals and procaryotes such as bacte- 
ria. In the case a mammalian source, for example the uterus of a mammal, e.g., of human, is chosen, the DNA coding 
for prolylendopeptidase is preferentially derived from the RNA via cDNA production. 

[001 6] A preferred source for DNA coding for prolylendopeptidase is a bacterium belonging to the genus Flavobacte- 
rium , in particular F. meningosepticum . most preferentially F. meningosepticum strain IFO 12535 (ATCC 13253). 

20 [001 7] Since DNA derived from a prokaryote such as bacterium, for example, F. meningosepticum . does not include 
introns, a genomic DNA can be used to clone a prolyl endopeptidase-encoding DNA if a prokaryotic cell is chosen as 
source. In this case, bacterial cells which produce prolylendopeptidase, for example, cells of F meningosepticum . is 
homogenised and a whole genomic DNA is extracted according to a conventional procedure (Saito, H. et al. Biochim. 
Biophys. Acta, 1963, 72, 619-629). The extracted DNA is then digested completely or partially with an appropriate 

25 restriction enzyme such as Bgl II, Eco Rl, Hinc II, Hind III, Pst I or Bam HI. The digestion product is then preferably sub- 
jected to preparative electrophoresis with low-melting-point agarose gel to enrich DNA fractions of a certain length. This 
is intended to enrich DNA fragments encoding prolylendopeptidase. Next, the DNA fragments are cloned into a suitable 
cloning vector. The cloning vector may be derived from any vector useful in the art of genetic engineering, such as from 
viruses, phages, cosmids, plasmids or chromosomal DNA, for example derivatives of SV40, Herpes-viruses, Papilloma 

30 viruses, Retroviruses, Baculovirus, phage X, e.g., NM989 or EMBL4, or phage M13, bacterial plasmids, e.g. pBR322, 
pUC18, pSF2124, pBR317orpPLMu., or yeast plasmids, e.g. yeast 2 juplasmid, or also chromosomal DNA comprising 
an origin of replication or an autonomously replicating sequence (ARS). Preferably, the cloning vector is a bacterial vec- 
tor such as pBR322, pUC18, pUC19 or the like. 

[0018] Alternatively, a cDNA library may be prepared from a cell expressing prolylendopeptidase, e.g. from a bacterial 
35 cell such as preferably from a bacterium belonging to the genus Flavobacterium . in particular F meningosepticum , most 
preferentially F. meningosepticum strain IFO 12535 (ATCC 13253), or from a eukaryotic cell or tissue, e.g. mammalian 
cell or tissue, which produces prolylendopeptidase. For example, RNA is extracted from the human uterus and enriched 
for mRNA according to a conventional procedure. Next, a cDNA library is constructed according to a conventional pro- 
cedure such as the Okayama-Berg method (Okayama, H. et al. Mol. Cell. Biol. 1982, 2, 161-170), the method of Gubler 
40 and Hoffman (Gubler, U. et al. Gene, 1983, 25, 263-270) or the like. 

[001 9] A variety of methods are known in the art for the incorporation of double-stranded cDNA or genomic DNA into 
an appropriate vector. For example, complementary homopolymer tracts may be added to the double-stranded DNA 
and the vector DNA by incubation in the presence of the corresponding deoxynucleoside triphosphates and an enzyme 
such as terminal deoxynucleotidyl transferase. The vector and double-stranded DNA are then joined by base pairing 
45 between the complementary homopolymeric tails and finally ligated by specific joining enzymes such as ligases. Other 
possibilities are the addition of synthetic linkers to the termini of the double-stranded DNA, or the incorporation of the 
double-stranded DNA into the vector by blunt- or staggered-end ligation. 

[0020] Screening of the genomic DNA library or cDNA library is preferably achieved using a DNA hybridization probe. 
Suitable DNA probes are DNAs of known nucleotide sequence consisting of at least 17 nucleotides, for example syn- 

50 thetic DNAs, cDNAs derived from mRNA coding for prolylendopeptidase, or genomic DNA fragments comprising e.g. 
adjacent DNA sequences which are isolated from a natural source or from a genetically engineered microorganism. 
[0021 ] To design synthetic DNA probes for screening the above-mentioned genomic DNA library or cDNA library, pro- 
lylendopeptidase for which a DNA coding region is to be cloned is purified, and its partial amino acid sequence is deter- 
mined according to a conventional procedure. Next, DNA sequences are designed on the basis of the partial amino acid 

55 sequence thus determined. Where an exact nucleotide sequence coding for the amino acid sequence is not known, a 
combination of nucleotide sequences which partially or totally cover possible nucleotide sequences present due to the 
degeneracy of genetic codon may be used. Alternatively, the third nucleotide in a codon may be replaced with inosine^ 
[0022] Synthetic DNA probes are synthesized according to known methods, for example by stepwise condensation 
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using the solid phase phosphotriester, phosphite triester or phosphoramidite method, e.g., the condensation of dinucle- 
otide coupling units by the phosphotriester method. These methods are adapted to the synthesis of mixtures of the 
desired oligonucleotides by using mixtures of two, three or four nucleotides dA, dC, dG and/or dT in protected form or 
the corresponding dinucleotide coupling units in the appropriate condensation step as described by Y IKe et al. (Nucleic 

5 Acids Research 1J_, 477, 1 983) . 

[0023] For hybridization, the DNA probes are labelled, e.g. radioactively labelled by the well known kinase reaction. 
The hybridization is performed according to known procedures, i.e., in buffer and salt solutions containing adjuncts, e.g. 
calcium chelators, viscosity regulating compounds, proteins, non-homologous DNA and the like, at temperatures favor- 
ing selective hybridization, e.g., between 0°C and 80°C, for example between 25°C and 50°C. 

10 [0024] In the prefered embodiment, the DNA library of R meningosepticum is used to transform an appropriate host 
such as E. coli cells, which are then plated and cultured on a solid medium to develop colonies, and positive clones are 
selected by a colony hybridization method using the above-mentioned DNA probes. The transformation of appropriate 
host cells with the DNA library and the selection and multiplication of transformed host cells are well known in the art. 
Examples of such methods are given below. 

15 [0025] The nucleotide sequence of DNA selected as described above can be determined by methods known per se : 
for example, by the Maxam-Gilbert method using end-labelled DNA or by the dideoxy chain termination method of 
Sanger. 

[0026] A nucleotide sequence of genomic DNA of Flavobacterium meningosepticum origin coding for prolylendopepti- 
dase and a corresponding amino acid sequence are shown in the Sequence Listing, SEQ ID No. 1 . 

20 [0027] Once a nucleotide sequence coding for, or an amino acid sequence of, prolylendopeptidase is determined, a 
DNA coding for the enzyme also can be prepared by an in vitro synthesis according to conventional methods. Suitable 
methods of the synthesis of DNA have been presented in summary form by S.A. Narang (Tetrahedron 39, 3, 1983). The 
known synthesis techniques allow the preparation of polynucleotides up to 120 bases in length, in good yield, high 
purity and in a relatively short time. Suitably protected nucleotides are linked with one another by the phosphodiester 

25 method (K.L. Agarwal et al., Angew. Chemie 84, 489, 1972), the more efficient phosphotriester method (C.B. Reese, 
Tetrahedron 34, 3143, 1978), the phophite triester method (R.L Letsinger etal., J. Am. Chem. Soc. 98, 3655, 1976) or 
phosphoramidite method (S.L Beaucage and M.H. Carruthers, Tetrahedron Letters 22, 1859,1981). 
[0028] Simplification of the synthesis of the oligonucleotides and polynucleotides is made possible by the solid phase 
method, in which the nucleotide chains are bound to a suitable polymer. H. Rink et al. (Nucl. Acids Reserach 12, 6369, 

30 1984) use trinucleotides instead of individual nucleotides and link them by the phosphotriester method in the solid 
phase synthesis. A polynucleotide can thus be prepared in a short time and with good yields. The actual double- 
stranded DNA is built up enzymatically from chemically prepared overlapping oligonucleotides from both DNA strands, 
which are held together in the correct arrangement by base-pairing and are then chemically linked by the enzyme DNA 
ligase. Another possibility comprises incubating overlapping single oligonucleotides from the two DNA strands in the 

35 presence of the four required deoxynucleoside triphosphates with a DNA polymerase, for example DNA polymerase I, 
the Klenow fragment of polymerase I or T4DNA polymerase, or with AMV (avian myeloblastosis virus) reverse tran- 
scriptase. The two oligonucleotides are thereby held together in the correct arrangement by base-pairing and are sup- 
plemented with the required nucleotides by the enzyme to give a complete double-stranded DNA (Scarpulla et al., Anal. 
Biochem. 121, 356, 1982). 

40 

Hybrid vector containing gene coding for prolylendopeptidase 

[0029] In the present invention, hybrid vectors include a hybrid vector for cloning or amplifying a desired prolylen- 
dopeptidase gene, and expression vectors. A hybrid vector of the invention comprises a DNA sequence coding for pro- 

45 lylendopeptidase defined hereinbefore. 

[0030] The hybrid vectors are derived from any vector useful in the art of genetic engineering, such as from viruses, 
phages, cosmids, plasmids or chromosomal DNA, for example derivatives of SV40, Herpes-viruses, Papilloma viruses, 
Retroviruses, Baculovirus, phage k, e.g. NM989 or EMBL4, or phage M13, bacterial plasmids, e.g. pBR322, pUC18, 
pSF21 24, pBR31 7 or pPLMu., or yeast plasmids, e.g. yeast 2\i plasmid, or also chromosomal DNA comprising an origin 

so of replication or an autonomously replicating sequence (ARS), or a defective virus, phage or plasmid in the presence of 
a helper virus, phage or plasmid allowing replication of said defective virus, phage or plasmid, e.g. M13(+)KS vector in 
presence of e.g. M13K07 helper phage. The Baculoviruses which can be used in the present inveniton are, for example, 
Autographa californica nuclear polyhedrosis virus (AcMNPV), Trichoplusia nj MNPV, Rachiplusia ou MNPV, Galleria 
mellonella MNPV, Bombyx mori nuclear polyhedrosis virus (BmNPV), and the like. A kit comprising a combination of an 

55 Autographa californica nuclear polyhedrosis virus and baculovirus transfer vectors pAc700, pAc701 , pAc702, pVL1392 
and pVL1 393 is commercially available from Invitrogen. 

[0031] A suitable vector of the invention is a vector which is operable in the microbial host cell chosen for multiplying 
the hybrid vector of for the expression of prolylendopeptidase. Suitable vectors contain a complete replicon and a 
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marker gene, which renders possible the selection and identification of the microorganisms transformed by the expres- 
sion plasmids by means of a phenotype feature. 

[0032] Thus, the hybrid vectors of the invention provide for replication of a desired prolylendopeptidase DNA in a suit- 
able host, either as an extrachromosomal element or by integration in the host chromosome. Several possible vector 

s systems are available for integration and expression of the cloned DNA of the invention. In principle, all vectors which 
replicate and/or comprise a recombinant gene which can be expressed in the chosen host are suitable. The vector is 
selected depending on the host cells envisaged for transformation. In general, such host cells may be prokaryotic or 
eukaryotic micro-organisms such as bacteria, fungi such as yeasts or filamentous fungi, or cells of higher eukaryotic 
origin such as animal, for example mammalian or insect, cells. Suitable host cells will be discussed in detail hereinbe- 

10 low. In principle, the hybrid vectors of the invention comprise a DNA encoding prolylendopeptidase, an origin of replica- 
tion or an autonomously replicating sequence, optionally dominant marker sequences, and, optionally, additional 
restriction sites. 

[0033] An origin of replication or an autonomously replicating sequence (a DNA element which confers autonomously 
replicating capabilities to extrachromosomal elements) is provided either by construction of the vector to include an exo- 
15 geneous origin such as derived from Simian virus (SV40) or another viral source, or by the host cell chromosomal 
mechanisms. 

[0034] A hybrid vector of the invention may contain selective markers depending on the host which is to be trans- 
formed, selected and cloned. Any marker gene can be used which facilitates the selection of transformants due to the 
phenotypic expression of the marker. Suitable markers are particularly genes from which a polypeptide can be 

20 expressed which provides resistance against compounds toxic to the receipt organism or which completes the enzyme 
system of a mutant lackig such an essential polypeptide, e.g. of an auxotrophic mutant. Suitable marker genes express, 
for example, antibiotic resistance, e.g. against tetracycline, ampicillin, or cycloheximide or provide for prototrophy in an 
auxotrophic mutant, for example in a yeast deficient in the ura3 . Ieu2 . his3 or trpL gene. It is also possible to employ as 
markers structural genes which are associated with an autonomously replicating segment providing that the host to be 

25 transformed is auxotrophic for the product expressed by the marker. 

[0035] Within the meaning of hybrid vectors of the invention are also hybrid expression vectors for the expression of 
prolylendopeptidase. They have in general the same features as the hybrid vectors described hereinbefore, and addi- 
tionally comprise expression control sequences allowing the production and, optionally, the secretion of prolylen- 
dopeptidase. Thus, hybrid expression vectors of the invention comprise a promoter region operably linked with a 

30 structural gene encoding prolylendopeptidase and, optionally, a DNA fragment encoding a leader or signal peptide, a 
transcriptional enhancer, a ribosomal binding site, a transcriptional terminator region and/or further regulatory 
sequences. 

[0036] A wide variety of promoter sequences may be employed, depending on the nature of the host cell. Promoters 
that are strong and at the same time well regulated are the most useful. Sequences for the initiation of translation are 
35 for example Shine-Dalgarno sequences. Sequences necessary for the initiation and termination of transcription and for 
stabilizing the mRNA are commonly available from the noncoding 5'-regions and 3'-regions, respectively, of viral or 
eukaryotic cDNAs, e.g from the expression host. 

[0037] Examples of suitable promoters are XP L , XP R , or XN, E. coli lac, trp, tac, or Ipp, yeast TRP1 -, ADHI-, ADHII-, 
PH03-, PHOS-, or glycolytic promoters such as the promoter of the enolase, glyceraldehyde-3-phosphate dehydroge- 

40 nase, 3-phosphoglycerate kinase (PGK), hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phos- 
phate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomerase and glucokinase genes, or promoters derived from eukaryotic viruses, e.g. SV40, Rous sarcoma virus, ade- 
novirus 2, bovine papilloma virus, papovavirus, cytomegalovirus or Baculovirus, e.g. Autoqrapha californica nuclear pol- 
yhedrosis virus (AcMNPV), Trichoplusia ni MNPV, Rachiplusia ou MNPV, Galleria mellonella MNPV, derived promoters 

45 or mammalian cell derived promoters, e.g. of the actin, collagen, myosin, or p-globin gene. A preferred eukaryotic pro- 
moter is a polyhedrin gene promoter of a Baculovirus, preferentially of the Autographa californica nuclear polyhedrosis 
virus (AcMNPV). The eukaryotic promoters may be combined with enhancing sequences such as the yeast upstream 
activating sequences (UAS) or viral or cellular enhancers such as the cytomegalovirus IE enhancers, SV40 enhancer, 
immuno-globulin gene enhancer or others. 

so [0038] Enhancers useful for the expression are transcription-stimulating DNA sequences, e.g. derived from viruses 
such as Simian virus, Cytomegalovirus, polyoma virus, bovine papilloma virus or Moloney sarcoma virus, or of genomic 
origin. An enhancer sequence may also be derived from the extrachromosomal ribosomal DNA of Physarum polyceph- 
alum (PCT WO 86/00089), or it may be the upstream activation site from the acid phosphatase PH05 gene (EP-B-0 213 
593), or the PH05, trp, PH05-GAPDH hybrid (EP-B-0 213 593), or the like promoter. 

55 [0039] Signal sequences which can be used for the present inveniton may be, for example, a presequence or secre- 
tory leader directing the secretion of the polypeptide, or the like. Signal sequences which can be used in the present 
invention are known in the literature, e.g. compiled in von Heijne, G., Nucleic Acids Res. 14, 4683 (1986). Another suit- 
able signal sequence extends from amino acid 1 to 19 of the amino acid sequence depicted in the sequence listing 
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under SEQ ID No. 1. This signal sequence alone as well as a DNA molecule encoding same, preferably the DNA mol- 
ecule represented by nucleotides 260 to 316 of SEQ ID No. 1 is also covered by the present invention. 
[0040] A ribosomal binding site (Shine-Dalgamo Sequence) is either naturally linked to the promoter used or may be 
located on a short nucleotide sequence which may be covalently linked to the 5' end of the coding region for prolylen- 

5 dopeptidase. Ribosomal binding sites are known in the art. 

[0041] A promoter chosen for the construction of a hybrid expression vector of the invention may be regulated by a 
regulatory protein and the production of prolylendopeptidase in the transformed host cell then may be inducible or dere- 
pressible. The gene for the regulatory protein may be located either in the genome of the host strain, on an additional 
plasmid vector the host strain may be cotransformed with, or on the hybrid vector of the invention. The selection of a 

10 suitable gene for a regulatory protein depends on the promoter used. The conditions for the induction or derepression 
of the production of prolylendopeptidase also depend on the promoter and on the regulatory protein. A regulatory pro- 
tein which can be used in the present invention is, for example, a repressor protein, e.g. a product of the trpR, lacl, A,cro, 
or Xc\ gene, or a temperature sensitive mutant thereof. 

[0042] Preferred hybrid expression vectors of the invention are expression vectors suitable for the expression of 
15 mature prolylendopeptidase represented by the amino acid sequence shown in SEQ ID No. 1 in E. coli , more preferably 
such expression vectors comprising a signal sequence, preferably the signal sequence of the prolylendopeptidase gen 
shown under SEQ ID No. 1, operatively linked with the gene encoding the mature prolylendopeptidase. 
[0043] Most preferred expression vectors are plasmids pFPH5-KD50, pUK-FPEP-a and pUK-FPEP-b characterized 
in the accompanying examples. 

20 

Transformed hosts and preparation thereof 

[0044] The invention concerns a transformed host cell for multiplicating a recombinant DNA molecules of the invention 
or particularly for expressing a prolylendopeptidase structural gene comprised in a recombinant DNA molecule of the 
25 invention. 

[0045] The transformed microbial host strains are cultured in a liquid medium containing sources of carbon and nitro- 
gen which can be assimilated by the microbial cell, and inorganic salts, applying methods known in the art. The culture 
of the hosts is carried out in a conventional nutrient medium which may be supplemented with or deprived of chemical 
compounds allowing negative or positive selection of the transformants, i.e. such hosts containing the desired DNA mol- 

30 ecule together with a selection marker, from the non -transformants, i.e. such hosts lacking the desired DNA molecule. 
[0046] Any transformable hosts useful in the art may be used, e.g. bacteria, such as E. coli . fungi, such as Saccha- 
romyces cerevisiae . Kluyveromyces lactis , or filamentous fungi, such as Aspergillus Spec , e.g. A. nidulans , A. oryzae . 
A^ carbonari us . A. awamori or A. niger . However, the use of suitable hosts which are devoid of or poor in restriction 
enzymes or modification enzymes may be advantageous. Examples of such hosts are bacteria, e.g. Bacillus subtilis . 

35 Bacillus stearothermophilus. Pseudomonas . Haemophilus. Streptococcus and others, and yeasts, for example Saccha- 
romyces cerevisiae. and in particular strains of Escherichia coli . for example E. coli X1776, E. coli Y1090, E. coli 
W3110, E. coli HB101/LM1035, Bcoli JA 221. E. coli DH5a, or preferentially E i _co[i DH5aF', JM109, MH1 or HB101, 
or E. coli K12 strain. Further suitable hosts are cells of higher organisms, in particular established continuous human 
or animal cell lines, e.g. human embryonic lung fibroblasts L132, human malignant melanoma Bowes cells, HeLa cells, 

40 SV40 virus transformed kidney cells of African green monkey COS-7 or Chinese hamster ovary (CHO) cells. Other suit- 
able host cells are established insect cell lines, for example, Spodoptera fruoiperda . such as Sf21 or preferentially Sf9 
(ATCC CRL1711), Mamestra brassicae , Bombyx mori cell systems using Bombyx mori nuclear polyhedrosis virus 
(BmNPV) and the like. 

[0047] The invention concerns also a method for the preparation of such transformed hosts comprising treatment of 
45 a suitable host cell under transforming conditions with a recombinant DNA molecule of the present invention, especially 
a hybrid vector of the invention, optionally together with a selection marker gene and optionally selecting the transform- 
ants. 

[0048] Transformation of microorganisms is carried out according to conventional methods as described in the litera- 
ture, for example for S. cerevisiae (A. Hinnen et al., Proc. Natl. Acad Sci USA, 75, 1929, 1978), for B. subtilis (Anag- 

50 nostopoulos et al., J. Bacteriol. 81, 741, 1961), and for E. coli fM. Mandel et al., J. Mol. Biol 53, 159, 1970). 

[0049] Accordingly, the transformation procedure of E. coli cells includes, for example, Ca 2+ pretreatment of the cells 
so as to allow DNA uptake, and incubation with the hybrid vector. The subsequent selection of the transformed cells can 
be achieved, for example, by transferring the cells to a selective growth medium which allows separation of the trans- 
formed cells from the parent cells dependent on the nature of the marker sequence of the vector DNA. Preferably, a 

55 growth medium is used which does not allow growth of cells which do not contain the vector. The transformation of yeast 
comprises, for example, steps of enzymatic removal of the yeast cell wall by means of glucosidases, treatment of the 
obtained spheroplasts with the vector in the presence of polyethylene glycol and Ca 2+ ions, and regeneration of the cell 
wall by embedding the spheroplasts into agar. Preferably, the regeneration agar is prepared in a way to allow regener- 
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ation and selection of the transformed cells as described above at the same time. 

[0050] Transformation of cells of higher eukaryotic origin, such as mammalian cell lines, is preferably achieved by 
transfection. Transfection is carried out by conventional techniques, such as calcium phosphate precipitation, microin- 
jection, protoplast fusion, electroporation, i.e. introduction of DNAby a short electrical pulse which transiently increases 

s the permeability of the cell membrane, or in the presence of helper compounds such as diethylaminoethyldextran, 
dimethyl sulfoxide, glycerol or polyethylene glycol, and the like. After the transfection procedure, transfected cells are 
identified and selected e.g. by cultivation in a selective medium chosen depending on the nature of the selection 
marker, for example standard culture media such as Dulbecco's modified Eagle medium (DM EM), minimum essential 
medium, RPMI 1640 medium and the like, containing e.g. the corresponding antibiotic. 

10 [0051] The transformed host cells are cultured by methods, known in the art in a liquid medium containing assimilable 
sources of carbon, e.g. carbohydrates such as glucose or lactose, nitrogen, e.g. amino acids, peptides, proteins or their 
degradation products such as peptones, ammonium salts or the like, and inorganic salts, e.g. sulfates, phosphates 
and/or carbonates of sodium, potassium, magnesium and calcium. The medium furthermore contains, for example, 
growth-promoting substances, such as trace elements, for example iron, zinc, manganese and the like. 

15 [0052] The medium is preferably so chosen as to exert a selection pressure and prevent the growth of cells which 
have not been transformed or have lost the hybrid vector. Thus, for example, an antibiotic is added to the medium if the 
hybrid vector contains an antibiotic resistance gene as marker. If, for instance, a host cell is used which is auxotrophic 
in an essential amino acid whereas the hybrid vector contains a gene coding for an enzyme which complements the 
host defect, a minimal medium deficient of the said amino acid is used to culture the transformed cells. 

20 [0053] Cells of higher eukaryotic origin such as mammalian cells are grown under tissue culture conditions using com- 
mercially available media, for example Dulbecco's modified Eagle medium (DMEM), minimum essential medium, RPMI 
1640 medium and the like as mentioned above, optionally supplemented with growth -promoting substances and/or 
mammalian sera. Techniques for cell cultivation under tissue culture condition are well known in the art and include 
homogeneous suspension culture, e.g. in an airlift reactor or in a continuous stirrer reactor, or immobilized or entrapped 

25 cell culture, e.g. in hollow fibers, microcapsules, on agarose microbeads, porous glass beads, ceramic cartridges, or 
other microcarriers. 

[0054] Culturing is effected by processes which are known in the art. The culture conditions, such as temperature, pH 
value of the medium and fermentation time, are chosen so that a maximum expression level of the polypeptide or deriv- 
ative of the invention is obtained. Thus, an E. coli or yeast strain is preferably cultured under aerobic conditions by sub- 
30 merged culture with shaking or stirring at a temperature of about 20°C to 40°C, preferably at about 30°C, and a pH value 
of 4 to 8, preferably of about 7, for about 4 to 30 hours, preferably until maximum yields of the polypeptide or derivative 
of the invention are reached. 

Production of prolylendopeptidase 

35 

[0055] The present invention concerns also a method for the production of prolylendopeptidase. 
[0056] For the expression of prolylendopeptidase, either procaryotic or eucaryotic host cells may be used, e.g. E. coli 
strains defective in protease genes, e.g. in the ton protease gene, and genes involved in the regulation of heat shock 
induced protein synthesis, e.g. in the htpR gene (US Patent 4,758,512; Buell, G. et al., Nucleic Acids Res. 13: 1923- 
40 1938, 1985). 

[0057] Preferably, prolylendopeptidase is produced using E. coli . In this case to improve the expression, a 5'-terminal 
non-coding region of the cloned DNA is preferably removed while maintaining the full length of the coding region, par- 
ticularly the coding region of the mature polypeptide shown under SEQ ID No. 1. The coding region is most preferably 
functionally linked with a signal sequence allowing the secretion of the prolylendopeptidase. Moreover, the structural 
45 gene is functionally linked with a promoter region functional in E, coli . either heterologous to or natively linked with the 
prolylendopeptidase coding region. The linkage is performed according to a conventional procedure, for example, using 
an appropriate restriction enzyme site or deletion by digesting with an exonuclease such as E coli exonuclease III and 
successive blunting with a nuclease, e.g. mung-bean nuclease. 

[0058] In one of the most preferable embodiments, a genomic DNA having a linker sequence immediately upstream 
so of a full length coding region for prolylendopeptidase is linked with a heterologous promoter such as tec promoter in an 
expression vector, for example, a plasmid based on pUC1 19 plasmid. Very particularly, a coding region in the genomic 
DNA from Flavobacterium meningosepticum encodes a pro-form of prolylendopeptidase, e.g. such consisting of a 
mature form of the enzyme, for example consisting of amino acid residue 1 to the end of the sequence shown under 
SEQ ID No. 1 , and a signal peptide, for example amino acid residues -19 to -1 of the sequence shown under SEQ ID 
55 No. 1 . When such a type of an expression plasmid is used to transform E. coli host, and the transformant is cultured, 
then prolylendopeptidase is produced in E. coli cells and secreted into the periplasmic region. In the process of secre- 
tion the signal peptide is removed to give a mature form of the enzyme which is not incorporated in inclusion bodies, 
and therefore, the produced prolylendopeptidase is easily recovered. 
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[0059] According to another embodiment of the present invention, a DNA coding for the present enzyme is inserted 
into a baculovirus transfer vector to construct a recombinant baculovirus transfer vector, and the recombinant baculovi- 
rus transfer vector is then co-transfected with a baculovirus DNA to insect cells to carry out a homologous recombina- 
tion. 

5 [0060] The baculovirus transfer vector is usually a plasmid containing a segment of baculovirus DNA, which segment 
comprises a gene not essential for the replication of baculovirus. The gene not essential for the replication of baculovi- 
rus is, for example, a polyhedrin gene comprising a polyhedrin structure gene and a promoter thereof. Such baculovirs 
transfer vectors are, for example, pAcYMI (Matsuura, Y, et aL J. Gen. Virol. (1987) 68, 1233-1250), pAc31 1 , pAc360, 
pAc373, pAc380(USP4,745,051), pAc700, pAc701 , pAc702, pVL1392, pVL1393, and the like. Preferred is the use of 

10 pVL1392. 

[0061] The baculoviruses used in the present invention are, for example, Trichoplusia ni MNPV, Rachiplusia ou MNPV, 
Galleria mellonella MNPV, and the like. Preferentially used is Autoarapha californica nuclear polyhedrosis virus (AcM- 
NPV). A kit comprising a combination of an Autoarapha californica nuclear polyhedrosis virus and baculovirus transfer 
vectors pAc700, pAc701 , pAc702, pVL1392 and pVL1393 is commercially available from Invitrogen Corp., San Diego, 

15 CA, USA. The insect cells used in the present invention are established insect cell lines, for example, Spodoptera fru- 
giperda . such as Sf21 or preferentially Sf9 (ATCC CRL171 1), but also Mamestra brassicae and the like. A Bombyx mori 
cell system using Bombyx mori nuclear polyhedrosis virus (BmNPV) can also be used in the present invention. 
[0062] The homologous recombination is carried out in accordance with a conventional procedure as described, for 
example, in "A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures, M.D. Summers et al., 

20 Texas Agricultural Experiment Station Bulletin No. 1555". The transfected insect cells are cultured in accordance with a 
conventional procedure. Namely, the transfected insect cells may be cultured in any tissue culture medium in which 
insect cells can grow, such as Grace's or TC100 medium supplemented with mammalian serum, serum-free medium 
EX-CELL400, or the like, at a temperature of 20°C to 30°C, preferably 27°C to 28°C, for example 27°C, for 2 to 10 days, 
preferably 3 to 5 days. 

25 [0063] The expressed prolylendopeptidase can be extracted from microbial cells such as E.coli cells or a supernatant 
of a cell culture by conventional methods, e.g., comprising homogenization of the cells, chromatography such as ion- 
exchange, hydrophobic or size-exclusion chromatography, precipitation, e.g., with ammonium sulfate or acid, prepara- 
tive electrophoresis such as polyacrylamide gel electrophoresis or isoelectric focussing, and the like. Particularly, pro- 
lylendopeptidase from Flavobacterium meninqosepticum . which is expressed in E. coli . is easily and selectively 

30 extracted from the cells with an osmotic shock method if the enzyme is secreted to the periplasmic region. The obtained 
crude enzyme can be further purified with usual methods, e.g. comprising chromatography such as ion-exchange, 
hydrophobic or size-exclusion chromatography, preparative electrophoresis such as polyacrylamide gel electrophore- 
sis, or isoelectric focussing, and the like. 

35 Production of C-terminally amidated peptides 

[0064] The present invention concerns also a method for the production of C-terminus amidated peptides by use of 
prolylendopeptidase. Prolylendopeptidase catalyzes not only the hydrolytic cleavage of a peptide at the C-terminus side 
of a proline residue, but also, forming the peptide bond in the reverse manner of the hydrolysis, the coupling of a peptide 

40 fragment to C-terminus of the other fragment which is terminated by a proline residue. Under controlled conditions the 
coupling reaction is predominant and prolylendopeptidase is used to catalyze coupling of two peptide fragments (or of 
amino acid to a peptide fragment). The preferable conditions of the coupling are an excess of one of peptide fragments 
(or an amino acid), and the presence of an organic solvent, such as glycerol, ethylene glycol, butanediol, ethanol, n- 
propanol, i-propanol, acetonitrile, DMF, and DMSO, in a high concentration, typically more than 50 %. 

45 [0065] In preferable embodiments of the present invention, biologically active peptides whose C-termini are a-ami- 
dated and have proline residues, preferably, at or near their C-termini are prepared with prolylendopeptidase from two 
precursors thereof, wherein one of the precursors is a precursor peptide forming N-terminal region of the amidated bio- 
active peptide and having a proline residue at its C-terminus and another precursor is a precursor peptide or amino acid 
forming a C-terminal portion of the amidated bioactive peptide which precursor peptide or amino acid has been ami- 

so dated at C-terminus. The a-amidated bioactive peptides prepared with prolylendopeptidase involve aspartocin, bermor- 
phin, calcitonin, CGRP, CGRP II, crustacean erythrophore concentrating hormone, cockroach myoactive peptide I, 
color change hormone, glumitocin, granuliberin-R, isotocin, LH-RH, mesotocin, morphine modulating neuropepride, a- 
MSH, oxytocin, phenypressin, SCP A , SCP B , valitocin, vasopressin, and vasotocin. 

[0066] The present invention provides a process for the production of a recombinant DNA molecule comprising a 
55 gene coding for prolylendopeptidase, comprising the steps of preparing cDNA or genomic DNA from cells, preferentially 
bacterial cells, capable of producing prolylendopeptidase, inserting DNA fragments coding for prolylendopeptidase into 
a cloning vector, and selecting a hybrid vector containing the DNA coding for prolylendopeptidase. 
[0067] The present invention concerns in particular the embodiments disclosed in the examples. 
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DESCRIPTION OF THE FIGURES 
[0068] 

s Fig. 1 represents restriction maps of the cloned inserts in pFPEP02 and pFPEP03. The open box represents the 

open reading frame of the prolylendopeptidase gene and the solid box a consensus sequence of the catalytic site 
of seine protease. Cloned inserts of pFPH6, pFPH5 and its deletion subclones (KD50, KD7 and KD6) are also 
aligned with the maps on the same scale. 

10 Figs. 2 to 4 represent a process for construction of expression plasmids pUK-FPEP-a and pUK-FPEP-b starting 
from an intermediate plasmid pFPEP04. 

EXAMPLES 

is [0069] The present invention will now be further illustrated by, but is no means limited to, the following examples. 
[0070] In the Examples, the following materials and methods are commonly used. 
[0071] The bacterial strains and plasmids used are listed in Table I. 



20 




Table 1 




Strains and plasmids. 




Strains or plasmid 


Relevant genotype 


25 


Strains 




E. Coli 






JM 83 


ara,A(lac-proAB), rpsL(=strA), 08O r , lacZ AM15 


30 


JM109 


recAl, endA1, gyrA96, thi, hsdR17, supE44, relAI, X t A(lac-proAB), F'[proAB + , lacl q , lacZ 
AM 15, traD36] 


HB101 


F", hsdS20(r" B , m- B ), recA13, ara-14, proA2, lacY1, galK2, rpsL20 (Snf), xyl-5, mtl-l, 
supE44, X, mcrA + , mcrB" 




TGI 


supE, hsd A5, thi, D(lac-proAB), F'[proAB + , lacF, lacZ A M15, traD36] 


35 


F. meninaosepticum 


IFO 12535 (ATCC 13253) 




Plasmids 






pUC19 


Amp r , lad', lacZ' 




pUC118 


Amp r , lad', lacZ', M13IG 


40 


pUC119 


Amp r , lad', lacZ', M13IG 



[0072] Transformation, restriction mapping, preparation of plasmids, and other molecular cloning procedures are 
done by standard methods. (Sambrook, J. et al. "Molecular cloning: a laboratory manual," 2nd ed. 1989, Cold Spring 

45 Harbor Laboratory, Cold Spring Harbor; Silhavy, T.J. et al. "Experiments with gene fusions," 1984, Cold Spring Harbor 
Laboratory, Cold Spring Harbor.) Restriction enzymes and DNA-modifying enzymes are used according to the recom- 
mendations of the manufacturers. Deletion with exonuclease III is carried out by use of a Kilo-Sequence Deletion kit 
(Yanisch-Perron, C. et al. Gene, 1985, 33, 103-119; Henikoff, S. Gene, 1984, 28, 351-359.). The nucleotide sequences 
are determined by the dideoxy method, by using a Sequenase kit. Genomic DNA from F meninaosepticum is isolated 

so by the method of Saito and Miura (Saito, H. et al. Biochim. Biophys. Acta, 1963, 72, 619-629). 

[0073] Restriction enzymes, DNA-modifying enzymes, the Kilo-Sequence Deletion kit and the MEGALABEL kit are 
purchased from Takara Shuzo Co. Ltd. (Kyoto). The Sequenase Ver.2.0 kit is the product of U.S. Biochemical Corp. 
(Cleveland, Ohio). Prolylendopeptidase from F. meninqosepticum and Endoproteinase Asp-N are purchased from Seik- 
agaku Corp. (Tokyo) and Boehringer Mannheim- Yamanouchi Co. Ltd. (Tokyo), respectively. The enzyme substrates, Z- 

55 Gly-Pro-p-naphthylamide and Z-Gly-Pro-P-nitroanilide, are obtained from Novabiochem AG(Laeufelfingen, Switzer- 
land). Radio isotopes are purchased from Amersham Japan Co. Ltd. (Tokyo) and other biochemicals are obtained from 
Sigma Chemical Co. (St. Louis, Missouri), Wako Pure Chemical Industries Ltd. (Osaka) and Nacalai Tesque Inc. 
(Kyoto). 
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Example 1 : Preparation of DNA probes 

[0074] Commercially obtained prolylendopeptidase is purified by reverse phase HPLC on a 4.6 x 35 mm TSKgel Octa- 
decyl NPR column (Tosoh Co. Ltd.). The column is eluted with 0.01 % TFA in water and a 3:1 mixture of CH 3 CN and i- 
5 PrOH, at a flow rate of 1 ml/min. The gradient from 35-70% of the organic solvent mixture is applied over 40 mm. and 
the major peak is collected. 

[0075] Since N-terminus of the endopeptidase is blocked, the enzyme must be subjected to proteolytic cleavage to 
determine its partial primary structure. The proteases commonly used for the cleavage like trypsin do not give satisfac- 
tory results. Therefore, proteases and conditions of the hydrolytic cleavage are systematically investigated and Endo- 

10 proteinase Asp-N is found to give the best result. 

[0076] The purified enzyme (0.5 mg) in 10 mM ammonium carbonate, pH 7.9, containing 4mM urea is hydrolyzed by 
1 jag of Endoproteinase Asp-N at 37°C for 24h. The peptide mixture obtained by this digestion is separated by reverse 
phase HPLC on a 4.6 x 250 mm Vydac C18 column (Separations Group Corp.) with the mobile phase of 0.01% TFA in 
water and a. 3:1 mixture of CH 3 CN and i-PrOH. The flow rate is 1 ml/min. The isolated peptides are further purified by 

15 rechromatography. The amino acid sequence of the purified fragments are determined by manual Edman degradation 
using the methods described by Kobayashi and Tarr (Kobayashi, R. et al. Tanpakushitsu Kakusan Koso, 1 986, 31 , 991 - 
1002; Tan, G.E. "Methods in protein sequencing analysis" (ed. Elzinga, M.), 1982, 223-232, Humana Press, New Jer- 
sey). 

[0077] The nucleotide sequences for the probes are not uniquely determined from the amino acid sequences because 
20 of multiple codon usage. Out of the 23 partial amino acid sequences six which give relatively less combinations of pos- 
sible nucleotide sequences are chosen to make DNA probes (Table II). Preferred codon usage in F. meninqosepticum 
has not been known, and two guidelines are adopted in the design of the nucleotide probes. Namely, three of the 6 
probes (A-12, 13 and 19) are designed so as to consist of a single oligonucleotide sequence, selecting the most prob- 
able codon for each amino acid residue on the assumption that the genome DNA of F. meninqosepticum is GC rich. The 
25 other three (A-3, 9 and 1 8) are mixtures of oligonucleotides of the possible sequences. To reduce further the number of 
the possible sequences in the mixture, inosine (I) is placed at the position which can be one of four bases, A, G, C and 
T, since inosine forms stable base pairs with all of four. 



Table II 





Determined partial amino acid sequences of the fragments of prolylendopeptidase obtained by the Endoprotein- 
ase Asp-N digestion (shown by the amino acid residur No. in SEQ ID No. 1), and corresponding nucleotide posi- 
tions in SEQ ID No. 1 of the probes designed from the amino acid sequences. 


35 


Fragment No. 


Amino acid residue No. in 
SEQ ID No.1 


Probe No. 


Corresponding nucleotide 
position in SEQ ID No. 1 




3 


499-509 


A-3 


1811-1833 




9 


352-364 


A-9 


1370-1407 


40 


12 


28-34 


A-12 


398-414 




13 


182-190 


A-13 


860-877 




18 


380-391 


A-18 


1454-1485 


45 


19 


268-276 


A-19 


1118-1137 



[0078] Oligonucleotides are synthesized with an Applied Biosystems Model 381 A DNA synthesizer. After removal of 
dimethoxytrityl group at the end of the synthetic sequence the oligonucleotides are deprotected and cleaved from the 
supports, according to the protocols of the manufacturer. The synthesized DNA are then subjected to preparative elec- 
so trophoresis with 8% polyacrylamide gel in 7M urea. Purified oligonucleotides are extracted from the separated bands 
and deionized by use of Waters Sep-Pack C-18 columns. 

Example 2: Evaluation of the probes 

55 [0079] The chromosomal DNA is isolated from F. meninqosepticum and digested by 4 kinds of commonly used restric- 
tion enzymes recognizing hexanucleotide sequence, i.e., Pstl, Hindlll, EcoRI and Bglll. 

[0080] Oligonucleotide probes are radio-labeled by use of a MEGALABEL kit with [y- 32 P]ATP to give a specific activity 
of ca. 1 x 1 0 6 cpm/pmol. The chromosomal fragments are electrophoresed on a 0.7% agarose gel and transferred to a 
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Millipore nitrocellulose filter by the method described by Sambrook et al. (Sambrook et al., 1989, supra). 
[0081] After prehybridization according to a standard protocol (Sambrook et al., 1989, supra), hybridization is carried 
out in 6x SSC hybridization solution with 0.2 pmol/ml of the labeled probe at 45°C for 16 h. The filter is washed with 6x 
SSC three times for 3 min. at room temperature and then once for 1 mm. at 45°C. Autoradiography is performed with a 
5 Fuji Bio-image analyzer BAS 2000. Only the A-3 probe is found to give a clear and specific signal with each of the 
digested DNA. 

Example 3 : Preparation of genomic library and screening thereof 

10 [0082] The molecular weight of prolylendopeptidase is found quite large, 76,000 by SDS-polyacrylamide gel electro- 
phoresis (Yoshimoto et al, 1980, supra). The size of the enzyme corresponds to 2 kb of the coding region in the 
genome. The larger the cloned DNA fragment is, the higher the chance of including the full length of the open reading 
frame. Therefore, rather a long fragment but small enough to get a high efficiency in the transformation is desired and 
7 kb of the Bglll fragment is selected. Namely, genomic DNA digested by Bglll is subjected to preparative electrophore- 

15 sis with low-melting-point agarose and the fraction of the gel containing 7 kb fragments is cut out. The excised gel piece 
is dissolved in a ligation mixture and the extracted chromosomal fragments are cloned into BamHI site of pUC19. By 
this ligation mixture E. coli HB101 is transformed to give a genomic library comprising about 4,000 recombinants. 
[0083] The genomic library is screened with the A-3 probe by colony hybridization and 119 positive clones are 
obtained. Sixteen positive clones are chosen and analyzed further by restriction endonuclease digestion and the 

20 enzyme assay. One clone with 7 kb insert is found to show a comparatively high prolylendopeptidase activity. The plas- 
mid is named pFPEP02 and further characterized. 

Example 4 : Restriction mapping and DNA sequencing of the isolated clones 

25 [0084] The restriction map of the insert of pFPEP02 is shown in Fig. 1 . To locate the coding region the insert DNA is 
cleaved by appropriate restriction enzymes and subcloned into pUC1 1 8 or pUC1 1 9. The clone which has 2.6kb of Hin- 
di - EcoRI fragment (pFPH6) shows the highest enzyme activity. To determine the entire nucleotide sequence of this 
fragment a series of deletion subclones are generated from either end of a larger fragment which includes the 2.6 kb 
fragment (pFPEP03, deposited as FERM BP-3466). From the deletion mutants and the subcloned restriction frag- 

30 ments, the whole sequence of the 2.6 kb fragment is determined by the dideoxy method (SEQ ID No. 1). 

[0085] The prolylendopeptidase gene is represented by the open box in Fig. 1. The gene is found to have an open 
reading frame of 2,1 18 bp, which is preceded by a putative promoter sequence separated by 28 bp from the ATG initi- 
ation codon. The endopeptidase, predicted from the nucleotide sequence, consists of 705 amino acid residues with a 
calculated molecular weight of 78,700, in good agreement with the value of 77,500 determined by sedimentation equi- 

35 librium (Yoshimoto, T. et al. Agric. Biol. Chem. 1982, 46, 2157-2158). The enzyme has been thought a serine protease 
based on the inhibitor study (Yoshimoto et al., 1980, supra). In agreement with the assumption there is a consensus 
sequence of catalytic site of seine protease, Gly-X-Ser-X-Gly, in the c-terminal region. The G-C content of the cloned 
Hindi- EcoRI fragment is 38.4% and rather low, contrary to expectations. In the protein data banks, NBRF-PIR and 
SWISS-PROT, no protein which has a significant homology with prolylendopeptidase is found. 

40 

Example 5 : Expression of prolylendopeptidase 

[0086] For the expression of prolylendopeptidase, E. coli transformed with a plasmid is cultured in TY medium at 37°C 
with agitation in a rotary shaker (120 rpm). TY-broth used for expression of prolylendopeptidase in E. coli contains 1% 
45 Bacto-tryptone, 0.1% Bacto-yeast extract, 0.1% glucose, 0.8% NaCI, pH 7. F. meningosepticum is grown in a polypep- 
tone medium which contains 1% polypeptone, 0.2% Bacto-yeast extract, 0.1% MgS0 4 7H 2 0 and 2.5% NaCI (pH7), 
according to instructions of a culture collection. 

[0087] Two assay methods (A and B) are employed for qualitative estimation of the enzyme activity and for the quan- 
titative evaluation, respectively. In the method A, Z-Gly-Pro-p-naphthylamide (Z = N-benzyloxycarbonyl) is used as a 

50 substrate. To 0.8 ml of 20 mM Tris-HCI buffer, pH 7.0, is added 0.1 ml of E. coli culture or diluted cell suspension and 
the mixture was preincubated at 37°C for 3 min. The reaction is started by the addition of 0.1 ml of the substrate solution 
(5 mM) in 40% dioxane and terminated after 10 min. by the addition of 0.5 ml Fast Garnet GBC solution (1 mg/ml) con- 
taining 10% Triton X-100 in 1 M acetate buffer, pH 4.0. The reaction mixture is left at room temperature for 20 min. and 
centrifuged at 12,000 g for 5 min. The absorbance of the supernatant is measured at 550 nm. 

55 [0088] In the quantitative assay (B), E. coli cells are harvested by centrifugation, washed with 0. 1 M HEPES (pH 7.4), 
and suspended in the same volume of the HEPES solution as the culture. The cell suspension is sonicated on ice for 
60 s with intervals over 3 min. to give a cell lysate. To 0.94 ml of 0.1 M potassium phosphate buffer, pH 7.0, is added 
0.05 ml of 4 mM Z-Gly-Pro-g-nitroanilide in 40% dioxane. After 3 min. preincubation at 30°C, 0.01 ml of the diluted 
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lysate is added to the mixture and the change of the absorbance is followed at 41 0 nm with a Hitachi spectrophotometer 
U-3210 at 30°C. One unit of the enzyme activity is defined as the amount of the enzyme that releases 1 jimol of p_- 
nitroaniline per minute, corresponding to 8.87 OD/min. with this standard procedure. 

[0089] Although the clone having the Hincll-EcoRI fragment (pFPH6) clearly shows the activity of prolylendopepti- 

5 dase, the activity is much smaller than that of the original bacterium. To improve the expression level, another set of 
deletion mutants are prepared from the clone having Hincll-BamHI fragment (pFPH5). As the deletion is extended from 
the 5' end of the fragment, the enzyme activity increases gradually to reach maximum with pFPH5-KD50 (Fig. 1). And 
then the activity decreases but still moderate (pFPFH5-KD7), when the deletion reaches to the initial part of the reading 
frame. Very little activity is detected from a clone with further deletion (pFPH5-KD6). Therefore, the expression of the 

10 enzyme in KD50 is further investigated in detail. 

[0090] The plasmid pFPH5-KD50 contains 120 bp of upstream noncoding region comprising the putative promoter 
sequence, together with the full length of the open reading frame. E. coN (JM83) transformed by this plasmid is grown 
in TY medium, and the time-course of the expression of prolylendopeptidase is followed with the enzyme activity of the 
total protein in homogenate of the washed cell (Table III). Around 6 to 12 h, as the growth of the bacterium stops, total 

15 activity of the enzyme increases rapidly and reaches the maximum value of 3,371 units/L, corresponding to 1 0 times of 
the enzyme activity attained by F. meningosepticum (346 units/L). The total activity stays constant until 24 h and then 
slowly decreases. On the other hand, the specific activity shows the rapid increase around 6 to 12 h similar to the total 
activity, but it increases gradually even after 24 h. A maximal specific activity of 10.6 units/mg protein is reached around 
36 h. Since typical specific activity of purified prolylendopeptidase is about 115 units/mg protein, the expression level of 

20 prolylendopeptidase in this clone amounts to about 1/10 of the total protein. 

[0091] Such a high expression level of prolylendopeptidase is also demonstrated in the SDS-PAGE analysis. Sample 
for SDS-PAGE is prepared by mixing the lysates described before with an equal volume of sample buffer containing 
SDS and p-mercaptoethanol and incubated at 96°C for 5 min. The electrophoresis is performed using a precast plate 
(12.5%, 84 x 90 x 1.0 mm) and the apparatus obtained from Daiichi Pure Chemicals Co. Ltd. (Tokyo), according to the 

25 protocols given by the manufacturer. The gel is stained with Coomassie Brilliant blue R250. 

[0092] At 12 h appearance of a new band is clearly distinguished at the same position as the standard of prolylen- 
dopeptidase from F. meningosepticum . Intensity of this band is highest around 12-24 h, concomitantly with the change 
of the total activity of the enzyme. 

[0093] The production of prolylendopeptidase is shown in the following Table III. 

30 



Table III 



Expression of prolylendopeptidase in E. coli (JM83) harboring pFPH5-KD50. 


Time (h) 


Absorbance at 550 nm in 
culture medium 


Total activity (unit/L) 


Specific activity (unit/mg 
protein) 


0 


0.12 


0 


0 


6 


4.14 


572 


2.2 


12 


6.06 


3371 


8.8 


24 


5.75 


3388 


9.7 


36 


5.20 


3003 


10.6 


48 


4.69 


2494 


10.5 



Example 6: Further improvement of expression 

[0094] The expression in E. coli JM 83 harboring pFPH5-KD50 has reached quite a high level, but an even higher 
so level would be preferable for the industrial production of prolylendopeptidase. In pFPH5-KD50 a putative promoter 
sequence, originating from R meningosepticum . is found and assumed to function in E. coli . To improve the expression 
level further, this native promoter is replaced by the strong trp-lac hybrid promoter (or tac promoter), as follows. 
[0095] Plasmid pFPEP02 is digested with Hindi to obtain a 3.1 kbp Hindi fragment containing prolylendopeptidase 
gene, which is then subcloned at the Smal site of pUC1 1 8 by blunt end ligation to give pFPEP04. After the ligation, the 
55 insertion points at the both ends of the fragment are cleavable neither by Hindi nor Smal (Fig. 2). To delete the Seal 
and Pvull sites in the coding region the synthetic double-stranded oligonucleotide fragment which corresponds to the 
sequence between the Smal and Pvull sites but is mutated at two positions (Synthetic Fragment I, see SEQ ID No. 2) 
is prepared by annealing the lower strand and the upper strand only whose 5' end has been phosphorylated beforehand 
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by T4 polynucleotide kinase. 

[0096] On the other hand, the plasmid pFPEP04 is cleaved at the single Smal site existing in the open reading frame, 
and the mutated Smal - Pvull fragment mentioned above is ligated to the linearized plasmid at both terminus Smal 
sites. The ligation product is then digested by Sad and the longer fragment, containing the 5' portion of the coding 
s region, is isolated by agarose gel electrophoresis. After kinasing the isolated fragment the missing piece between Pvull 
and Sacl sites, prepared separately from pFPEP04, is ligated to construct plasmid pFPEP04\ Since one nucleotide at 
the terminus generated by Pvull has been changed in the synthetic fragment, the Pvull site is not regenerated by this 
cyclization. 

[0097] Next, a new EcoRI site is created immediately upstream from the initiation codon of the prolylendopeptidase 
10 gene as follow (see Fig. 3). Synthetic Fragment II (SEQ ID No.3) is prepared by ligation of the four oligonucleotides U1 , 
U2, L1 and L2 (see SEQ ID No.4, 5, 6 and 7, respectively), where two of them (U2 and L1) have been phosphorylated 
at their 5' termini. The prepared fragment corresponds from the initiation codon to Pvull site in the upstream coding 
region and has protruding cohesive 5' terminus immediately upstream of the initiation site to introduce EcoRI site after 
ligation. For the following ligation the both 5' ends of the prepared fragment are phosphorylated by T4 polynucleotide 
is kinase. 

[0098] The plasmid pFPEP04', in which one of four Pvull has been deleted, is digested with Pvull, and the fragment 
containing most of the coding region is isolated by agarose gel electrophoresis and ligated to the second synthetic frag- 
ment described above. The product obtained by the ligation is digested with EcoRI to isolate the complete open reading 
frame with the 5' protruding cohesive ends at both termini, which is then subcloned at the EcoRI site of pUC1 19. The 
20 two synthetic regions in the obtained plasmid, pFPEP-EE, are sequenced and the mutated nucleotide sequences are 
confirmed. The resulting plasmid is pFPEP-EE. 

[0099] In the next step of the vector construction, as shown in Fig. 4, the whole coding region, together with a short 
downstream non-coding region, is cleaved out from pFPEP-EE by EcoRI. The fragment is then inserted into the EcoRI 
site of the expression vector, pKK223-3, to provide pKK-FPEP in which the transcription of the prolylendopeptidase 

25 gene is under the control of the tac promoter. 

[0100] The replication origin of pKK223-3 originates from pBR322 and the copy number of this expression vector in 
a single cell is usually low. Because of the higher dose effect of the gene a high copy number plasmid is preferable as 
an expression vector for a higher expression level. Therefore, a) a set of the promoter and the coding region or b) a set 
of the promoter, the coding region and the terminator is excised by BamHI or Bbill, respectively, and transplanted into 

30 the high copy number plasmid, pUC1 1 9. Since the peptidase gene is transferred from pKK-FPEP together with the tac 
promoter, the original jac promoter of pUC1 19 is removed by Pvull in order to avoid the double promoter. In between 
the blunt ends generated by this Pvull digestion either the gene set a) or the set b), which has been blunted by T4 DNA 
polymerase, is inserted to give pUK-FPEP-a or pUK-FPEP-b, respectively (Fig. 4). 

[0101] For overexpression of prolylendopeptidase, E. coli . JM 109, is transformed by pUK-FPEP-a or pUK-FPEP-b. 

35 The transformants are selected on LB plates containing 50 jug/ml of ampicillin and grown in LB medium with the ampi- 
cillin overnight. For expression of prolylendopeptidase 100 ml of CIRCLEGROW (BIO 101, Inc., Vista, California) 
medium (without ampicillin) is inoculated with 2 ml of the overnight culture and shaken at 37°C. In a preliminary exper- 
iment the maximum expression level in the transformant harboring pUK-FPEP-a (13,500 units/L) is found twice as high 
as that for pUK-FPEP-b (6,600 units/L) when the expression is boosted with 1 mM IPTG on a half day after the inocu- 

40 lation. Without the IPTG the basal expression levels for both clones are still quite high; in the case of pUK-FPEP-b it is 
even higher (8,060 units/L) than the level obtained by the addition of IPTG. 

[0102] Since pUK-FPEP-a shows the highest expression level, reaching 39 times of the enzyme activity obtained by 
F. meninqosepticum and 4 times of the expression level attained with the pFPH5-KD50, the time course of the expres- 
sion is followed with or without the addition of IPTG (Table IV)- As the growth, monitored by the absorbance of the cul- 

45 ture broth at 550 nm, slowed down around 4 h after the inoculation, the activity of the enzyme increases linearly along 
the time course. Without the boosting the expression by the IPTG the activity increases further linearly up to 20 h to 
attain the maximum level of 7,400 units/L at 24 h and then decreases. The specific activity of the endopeptidase stayed 
constant around 1 1 units/mg protein after 12 h. On the other hand, when 1 mM IPTG is added at 12 h, the increase is 
clearly enhanced immediately after the addition, reaching the highest expression level of 13,500 units/L at 28 h. A sig- 

so nificant increase in the specific activity is also seen simultaneously with the sharp increase in the expression level and 
the maximum specific activity, 40 units/mg protein, is observed at 28 h and later, clearly demonstrating the effect of the 
IPTG induction on the tac promoter. Since a typical pure preparation of prolylendopeptidase from F. meningosepticum 
was reported to show a specific activity of 1 1 5 units/mg protein (Yoshimoto, T.et al. , 1 978 supra), the expressed enzyme 
accounts for over 30% of the total protein extracted from E. coli . 

55 [0103] Such a high expression of prolylendopeptidase is also demonstrated in SDS-PAGE analysis. Already at 6 h 
after the inoculation, appearance of a new band is clearly noticed at the same position as the standard of prolylen- 
dopeptidase, and the expansion of that band demonstrates the increase in the expression level clearly during the time 
course from 12 h to 28 h, concomitantly with the change of the total activity of the enzyme. 
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55 Example 7 : Production of recombinant prolylendopeptidase 

[0104] Since prolylendopeptidase is expressed in E. coli in a soluble and active form and the expression level is so 
high (more than 30% of the total extracted protein), the isolation of the expressed enzyme is quite straightforward. 
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[0105] E. coli cells transformed with the expression plasmid pUK-FPEP-a are cultured in the CIRCLEGROW medium 
with boosting by IPTG for 28 hours at 37 °C. E. coli cells are harvested by centrifugation (3,000 g for 10 min) and 
washed with cold 0.1 M HEPES buffer, pH 7.4. The washed cells are re-suspended in 0.1 M HEPES and disrupted by 
sonication intermittently over 20 mm with a SONIFIER 450 (BRANSON Sonic Power Co.). The lysate is then fraction- 

5 ated with ammonium sulfate precipitation. The protein precipitated at 65 - 90% saturated ammonium sulfate is dissolved 
in 20 mM phosphate buffer, pH 6.2 and dialyzed against the same buffer. The dialyzate is applied to a CM52 column 
(Whatman BioSystems Ltd.) equilibrated with the same buffer. The enzyme is eluted by a linear gradient of NaCI. Active 
fractions are combined, concentrated by ultrafiltration with a Amicons cell and a YM30 membrane (Amicon Div. p 
W.R.Grace & Co.) and dialyzed against 20 mM phosphate buffer, pH 6.8. Prolylendopeptidase is further purified on a 

10 Mono S HR 10/10 column (Pharmacia LKB Biotechnology AB). With a NaCI gradient (0 - 0.075 M) the endopeptidase 
is eluted in a sharp peak around 0.035 M of NaCI. The purified enzyme appears to be homogeneous yielding only a 
single band in SDS-PAGE. 

Example 8 : Production of luteinizing hormone-releasing hormone (LH-RH) 

15 

[0106] Since LH-RH (a decapeptide) has a proline residue at the penultimate position, it is prepared from the non- 
apeptide precursor and glycine-amide. In the presence of a catalytic amount of prolylendopeptidase (0.08 nM), 1 mM 
of the precursor and 2.0 M glycineamide are incubated in 60% glycerol at pH 7.0 and 30°C. In 48 h the coupling comes 
to equilibrium with the hydrolysis and LH-RH is obtained in the conversion of 67% in a quantitative yield (97% isolation 
20 yield). The rest of the precursor is recovered in HPLC purification and no side reaction is noticed. 

Example 9 : Production of oxytocin 

[0107] Oxytocin, a nonapeptide having a proline residue at the third position from its c-terminus, is obtained by cou- 
25 pling the precursor of the first seven residues to leucylglycine-amide with prolylendopeptidase. In a typical example 1 
mM of the oxytocin precursor [1-7] and 0.8 M leucylglycine-amide are incubated with 0.13 \iM prolylendopeptidase in 
60% glycerol at pH 6.5 and 30°C. The coupling proceeds to reach an equilibrium after 48 h and 55% of the precursor 
is converted to oxytocin in a quantitative yield (91% isolation yield). No by-product is detected and 43% of the starting 
material is recovered. 

30 

Example 10: Alternative method for purification of recombinant prolylendopeptidase 

[0108] E. coli cells expressing prolylendopeptidase are harvested by centrifugation (10,000 g for 10 min) and washed 
with cold 0.1 M Tris-HCI buffer, pH 8.0. The washed cells are re-suspended in 0.5 M sucrose solution containing 5 mM 

35 EDTA buffered at pH 8.0 with 0.1 M Tris-HCI. Lysozyme (160 jug/ml) is added to the suspension and the mixture is left 
on ice for 2 min before dilution with the same volume of ice-cold water. The diluted cell suspension is further left on ice 
for 30 mm and then centrifuged at 10,000 g for 20 min. The supernatant is diluted again with the same volume of ice- 
cold water and its pH is adjusted to 7.0 with 1 N NaOH. The obtained crude enzyme solution (a periplasma fraction) is 
directly applied to a CM 52 column (Whatman BioSystems Ltd.) equilibrated with 20 mM phosphate buffer, pH 6.8. The 

40 enzyme is eluted in a single peak with a linear gradient (0 - 0.25 M) of NaCI. In this alternative purification method with 
the osmotic shock procedure, prolylendopeptidase is purified to homogeneity as judged by SDS-page analysis with a 
single chromatography step. 

Example 1 1 : Properties of recombinant prolylendopeptidase from Flavobacterium meningosepticum expressed in E. 

45 COli 

[0109] The method (B) described in Example 5 is modified to assay purified prolylendopeptidase. DTT (1 mM) and 
BSA (1 00 jug/ml) are included in the buffer solution of 0. 1 M phosphate (pH 7.0) for the assay to improve reproducibility 
and accuracy of the kinetic measurement. To 0.94 ml of this buffer solution is added 0.05 ml of 4 mM Z-Gly-Pro-p- 
50 nitroanilide in 40 % dioxane. After 3 min preincubation at 30°C, 0.01 ml of the enzyme solution obtained in example 10 
is added to the mixture and changes of the absorbance at 410 nm is followed at 30°C. 

[0110] In a typical purification the enzyme with a specific activity higher than 120 units/mg protein is obtained. The 
molecular weight is estimated to be 71,000 and 74,000 by gel filtration with a TSK-GEL G3000SW column (Tosoh 
Corp.) and SDS-PAGE, respectively. The enzyme shows an extinction coefficient E(1%/280 nm) of 15.5. There are two 
55 cysteine residues according to the deduced amino acid sequence with SEQ ID No. 1, but no free sulfhydryl group is 
detectable by titration with p-chloromercuribenzoic acid under a denatured condition with 8 M urea. 
[0111] The N-terminal sequence of the recombinant prolylendopeptidase is found to start with Ala and follows with 
Gin, Asn, Ser, Asn, X (unknown), Leu, Lys, Tyr, and Pro, proving that the first 19 amino acid residues shown in the 
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sequence with SEQ ID No. 1 encode a signal sequence and are missing from the N-terminus of the matured enzyme. 
Deposited Microorganisms 

5 [01 1 2] E. coli TG1/pFPEP03 was deposited under the Budabest Treaty with with the Fermentation Research Institute, 
Agency of Industrial Science and Technology (FRI), 1-3, Higashi 1-chome, Tsukuba-shi, Ibaraki, Japan, on July 5, 
1991, as FERM BP-3466. 

10 
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25 



SEQUENCE LISTING 
SEQ ID No. 1 

SEQUENCE TYPE: Nuceleotide with corresponding protein 
SEQUENCE LENGTH: 2636 base pairs 
w STRANDNESS: double 

TOPOLOGY: linear 
MOLECULE TYPE: genomic DNA 
ORIGINAL SOURCE 

15 

ORGANISM: Fl avobacterium meningosept icum 
INTERMEDIATE EXPERIMENTAL SOURCE 
NAME OF PLASMID: pFPEP03 
20 FEATURE: genomic DNA coding for proly lendopept idase 

from 1 to 259 promoter region 
from 260 to 316 signal sequence 
from 317 to 2374 mature prolylendopept idase 

coding region 
from 2375 to 2377 stop codon 

30 GTTGACGGTA AAGTAGTATT T AC T AAAAAG AGAGATAACA GGTCTTACGT 50 

ATCTGTAGCA CCAGATGCTT AAT TAAAGCA TTTTATAAAA ATTAAAACCT 100 

CAACGAAAGT TGAGGTTTTT TTTGTCTCAA AAACCTAACA GGTTTCTGAA 150 

ACCTGTTAGG TTTATTGTGT ATAGGGGTTA AG T GAT AC AT ATTTATACTG 2 00 

35 

TGCTGAAATG CGAATCTGAT TATTCGAAAA TTCTCCCTAT TTTTGATAAA 250 

ACCAATTCT ATG AAG TAC AAC AAA CTT TCT GTG GCA GTT GCA GCC 2 95 
Met Lys Tyr Asn Lys Leu Ser Val Ala Val Ala Ala 
40 -19 -15 -10 

TTT GCT TTT GCA GCT GTA TCA GCA CAA AAT TCT AAT GTT TTG 337 

Phe Ala Phe Ala Ala Val Ser Ala Gin Asn Ser Asn Val Leu 

45 

-5 -1 +1 5 

AAA TAT CCC GAA ACT AAA AAA GTA AGC CAT ACC GAT ACC TAT 37 9 
so Lys Tyr Pro Glu Thr Lys Lys Val Ser His Thr Asp Thr Tyr 

10 15 20 



55 
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55 



TTT GGT ACT CAG GTA TCC GAT CCT TAT CGC TGG CTG GAA GAC 421 

Phe Gly Thr Gin Val Ser Asp Pro Tyr Arg Trp Leu Glu Asp 

25 30 35 

GAC AGA GCC GAA GAT ACA AAA GCC TGG GTA CAA CAG GAA GTT 4 63 

Asp Arg Ala Glu Asp Thr Lys Ala Trp Val Gin Gin Glu Val 

40 45 

AAA TTT ACA CAA GAC TAC CTT GCA CAG ATT CCT TTC CGT GAT 505 

Lys Phe Thr Gin Asp Tyr Leu Ala Gin lie Pro Phe Arg Asp 

50 55 60 

CAG CTT AAA AAG CAA TTA ATG GAC ATC TGG AAT TAT GAG AAA 547 

Gin Leu Lys Lys Gin Leu Met Asp lie Trp Asn Tyr Glu Lys 

65 70 75 

ATT TCA GCA CCG TTT AAA AAA GGT AAA TAC ACC TAT TTT TCT 58 9 

lie Ser Ala Pro Phe Lys Lys Gly Lys Tyr Thr Tyr Phe Ser 

80 85 90 

AAA AAT GAT GGT CTT CAG GCG CAA TCT GTA CTT TAC AGA AAA 631 

Lys Asn Asp Gly Leu Gin Ala Gin Ser Val Leu Tyr Arg Lys 

95 100 105 

GAT GCG GCA GGT AAG ACG GAA GTA TTT TTA GAT CCT AAT AAG 67 3 

Asp Ala Ala Gly Lys Thr Glu Val Phe Leu Asp Pro Asn Lys 

110 115 

TTT TCG GAA AAA GGA ACC ACT TCT CTG GCA AGT GTT TCT TTT 715 

Phe Ser Glu Lys Gly Thr Thr Ser Leu Ala Ser Val Ser Phe 

120 125 130 

AAT AAA AAA GGA ACT CTG GTC GCT TAT AGT ATA TCA GAA GGA 75 7 

Asn Lys Lys Gly Thr Leu Val Ala Tyr Ser lie Ser Glu Gly 

135 140 145 
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GGT TCG GAC TGG AAT AAG ATT ATT ATT CTG GAT GCG GAA ACC 7 99 
Gly Ser Asp Trp Asn Lys lie lie lie Leu Asp Ala Glu Thr 
150 155 160 

AAA AAG CAA CTT GAT GAA ACT CTA TTG GAT GTT AAG TTC AGT 841 
Lys Lys Gin Leu Asp Glu Thr Leu Leu Asp Val Lys Phe Ser 
165 170 175 

GGA ATT TCA TGG TTG GGA GAT GAA GGA TTC TTT TAT TCC AGC 883 
Gly lie Ser Trp Leu Gly Asp Glu Gly Phe Phe Tyr Ser Ser 
180 185 

20 TAT GAT AAG CCA AAA GAA GGA AGC GTA CTT TCC GGG ATG ACA 925 

Tyr Asp Lys Pro Lys Glu Gly Ser val Leu Ser Gly Met Thr 
190 195 200 



10 



15 
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GAT AAA CAC AAA GTT TAT TTT CAT AAG TTA GGA ACG AAG CAG 9 67 

Asp Lys His Lys Val Tyr Phe His Lys Leu Gly Thr Lys Gin 
205 210 215 

TCT CAG GAT GAA TTG ATT ATT GGG GGT GAT AAA TTT CCA AGA 10 09 

Ser Gin Asp Glu Leu He lie Gly Gly Asp Lys Phe Pro Arg 

220 225 230 

AGA TAT ATA GGA GCT TAT GTA ACC GAT GAT CAG AGA TAT CTG 1051 

Arg Tyr He Gly Ala Tyr Val Thr Asp Asp Gin Arg Tyr Leu 
235 240 245 

GTG GTT TCG GCT GCA AAT GCA ACC AAC GGA AAC GAG CTT TAC 10 93 

Val Val Ser Ala Ala Asn Ala Thr Asn Gly Asn Glu Leu Tyr 
250 255 



ATT AAA GAC CTG AAG AAT AAA ACA GAT TTT ATT CCG ATT ATT 1135 
He Lys Asp Leu Lys Asn Lys Thr Asp Phe He Pro He He 
so 260 265 270 



55 
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ACA GGT TTT GAT AGC AAT GTA AAT GTT GCA GAT ACC GAC GGT 117 7 
Thr Gly Phe Asp Ser Asn Val Asn Val Ala Asp Thr Asp Gly 
5 275 230 285 

GAT ACG CTT TAT TTG TTC ACC GAT AAA GAT GCA CCG AAT AAG 1219 
10 Asp Thr Leu Tyr Leu Phe Thr Asp Lys Asp Ala Pro Asn Lys 

290 295 300 
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CGA CTG GTA AAA ACA ACG ATT CAG AAT CCA AAA GCG GAA ACA 12 61 
Arg Leu Val Lys Thr Thr lie Gin Asn Pro Lys Ala Glu Thr 
305 310. 315 

TGG AAA GAT GTG ATT GCT GAA ACC ACC GAA CCA TTC CAA ATC 130 3 
Trp Lys Asp Val lie Ala Glu Thr Ser Glu Pro Leu Glu lie 

320 325 

AAT ACG GGA GGC GGT TAT TTC TTT GCT ACT TAT ATG AAA GAT 134 5 

Asn Thr Gly Gly Gly Tyr Phe Phe Ala Thr Tyr Met Lys Asp 
330 335 340 

GCA ATC GAT CAG GTA AAG CAA TAT GAT AAA AAC GGA AAG CTT 13 87 

Ala lie Asp Gin Val Lys Gin Tyr Asp Lys Asn Gly Lys Leu 
345 350 355 

GTA AGG GCT ATA AAA TTA CCG GGA AGT GGT AAT GCA AGC GGT 14 29 
Val Arg Ala lie Lys Leu Pro Gly Ser Gly Asn Ala Ser Gly 
360 365 370 

TTT GGG GGT GAA AAA ACG GAA AAG GAT CTG TAT TAC TCT TTC 1471 
Phe Gly Gly Glu Lys Thr Glu Lys Asp Leu Tyr Tyr Ser Phe 
375 380 385 



ACC AAT TAT ATT ACG CCG CCA ACG ATC TTT AAA TAT AAT GTA 1513 

so Thr Asn Tyr lie Thr Pro Pro Thr lie Phe Lys Tyr Asn Val 

390 395 
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ACA ACA GGT AAT TCT GAA GTT TAC CAG AAG CCG AAA GTG AAG 15 55 

Thr Thr Gly Asn Ser Glu Val Tyr Gin Lys Pro Lys Val Lys 
400 405 410 

TTC AAT CCG GAA AAT TAT GTT TCG GAG CAG GTA TTC TAT ACT 15 97 

10 Phe Asn Pro Glu Asn Tyr Val Ser Glu Gin Val Phe Tyr Thr 

415 420 425 

TCA TCT GAC GGG ACT AAG ATT CCG ATG ATG ATC AGC TAC AAG 163 9 

15 

Ser Ser Asp Gly Thr Lys lie Pro Met Met lie Ser Tyr Lys 

430 435 440 

20 AAA GGC CTG AAA AAA GAC GGT AAA AAC CCT ACA ATA TTA TAC 1681 

Lys Gly Leu Lys Lys Asp Gly Lys Asn Pro Thr lie Leu Tyr 

445 450 455 



25 



30 



35 



45 



50 



AGC TAC GGA GGA TTT AAT ATC AGT CTT CAG CCT GCT TTC TCT 1723 
Ser Tyr Gly Gly Phe Asn lie Ser Leu Gin Pro Ala Phe Ser 
460 465 

GTT GTA AAT GCA ATC TGG ATG GAA AAC GGT GGT ATT TAT GCT 17 65 
Val Val Asn Ala He Trp Met Glu Asn Gly Gly He Tyr Ala 
470 475 480 



GTT CCG AAT ATC CGT GGT GGT GGA GAA TAT GGT AAG AAA TGG 18 07 

Val Pro Asn He Arg Gly Gly Gly Glu Tyr Gly Lys Lys Trp 

40 485 490 495 

CAT GAT GCC GGA ACT AAA ATG CAG AAA AAG AAT GTA TTT AAT 184 9 

His Asp Ala Gly Thr Lys Met Gin Lys Lys Asn Val Phe Asn 
500 505 510 



GAC TTT ATT GCA GCC GGA GAG TAC TTA CAG AAA AAC GGT TAT 18 91 
Asp Phe He Ala Ala Gly Glu Tyr Leu Gin Lys Asn Gly Tyr 
515 520 525 



55 
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ACA TCT AAG GAA TAT ATG GCG CTT TCC GGA CGT TCC AAC GGC 1933 

Thr Ser Lys Glu Tyr Met Ala Leu Ser Gly Arg Ser Asn Gly 
53 0 5 35 

GGT CTT CTT GTA GGG GCT ACG ATG ACA ATG CGC CCT GAT TTG 197 5 

10 Gly Leu Leu Val Gly Ala Thr Met Thr Met Arg Pro Asp Leu 

540 545 550 

GCA AAA GTT GCA TTC CCG GGA GTA GGA GTA CTG GAT ATG CTT 2017 

15 

Ala Lys Val Ala Phe Pro Gly Val Gly Val Leu Asp Met Leu 
555 560 565 

20 CGT TAT AAT AAG TTT ACA GCT GGT GCC GGT TGG GCT TAT GAT 2 05 9 

Arg Tyr Asn Lys Phe Thr Ala Gly Ala Gly Trp Ala Tyr Asp 
570 575 580 



25 



30 



35 



TAC GGT ACA GCA GAA GAC AGC AAG GAA ATG TTT GAA TAC CTG 2101 
Tyr Gly Thr Ala Glu Asp Ser Lys Glu Met Phe Glu Tyr Leu 
585 590 595 

AAG TCT TAT TCT CCG GTA CAT AAC GTA AAA GCC GGA ACT TGT 214 3 
Lys Ser Tyr Ser Pro Val His Asn Val Lys Ala Gly Thr Cys 
600 605 



TAT CCT TCT ACG ATG GTC ATT ACA AGT GAT CAT GAT GAC AGA 2185 
Tyr Pro Ser Thr Met Val lie Thr Ser Asp His Asp Asp Arg 
40 610 615 620 

GTT GTT CCC GCT CAT TCA TTT AAG TTC GGT TCA GAA TTA CAG 2227 
Val Val Pro Ala His Ser Phe Lys Phe Gly Ser Glu Leu Gin 
625 630 635 

GCA AAA CAA TCT TGT AAG AAT CCT ATT CTT ATC CGT ATT GAA 22 69 
so Ala Lys Gin Ser Cys Lys Asn Pro lie Leu lie Arg lie Glu 

640 645 650 



55 
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ACA AAT GCT GGA CAC GGA GCA GGA CGT TCC ACA GAA CAG GTC 2311 
Thr Asn Ala Gly His Gly Ala Gly Arg Ser Thr Glu Gin Val 
655 660 665 

GTT GCT GAG AAT GCC GAT CTG CTT TCA TTC GCA TTA TAT GAA 2353 
Val Ala Glu Asn Ala Asp Leu Leu Ser Phe Ala Leu Tyr Glu 
670 675 

ATG GGA ATT AAA AGT TTA AAA TAG ATTTCAAATA CTAAATATAA 23 97 
Met Gly lie Lys Ser Leu Lys 
680 685 

AACAGGCAGG TCTTTTTGAT TTGCCTGTTT TTTTATGATA CTATTGAGTT 24 4 7 

TGGATTATGT TAAATAGATT AGATCATGAG AT T TAT AT C T CAGGAAATGA 24 97 

TTAACTTTAA T AC AAAATC T T AT AC AAT GG AAAAT CAT G A CATGACAACT 25 4 7 

TTAGTACAGG T AAT G AAT AC TTTGAAAAGA AGAGGCGTGG ACAAAGAAAT 2 5 97 

CCAGATGACA GATGATAGGA AATTTATACT TCAGAATTC 2 63 6 

SEQ ID No. 2 

SEQUENCE TYPE: Nucleotide 

SEQUENCE LENGTH: 4 4 base pairs 

STRANDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

GGGAGTAGGA GTTCTGGATA TGCTTCGTTA TAATAAGTTT ACTG 4 4 



SEQU ID No. 3 
SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 54 base pairs 
STRANDNESS: double 
45 TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 



10 



15 



20 



25 



30 



35 



40 



50 



55 



AATTCATGAA GTACAACAAA CTTTCTGTGG CAGTTGCAGC CTTTGCTTTT 50 
GCAG 54 
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SEQ ID No. 4 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 23 base 
STRANDNESS: single 
TOPOLOGY; linear 
MOLECULE TYPE: synthetic DNA 

AATTCATGAA G T AC AAC AAA CTT 

SEQ ID No. 5 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 31 base 
STRANDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 

TCTGTGGCAG TTGCAGCCTT TGCTTTTGCA G 

SEQ ID No. 6 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 25 base 
STRANDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 

CACAGAAAGT TTGTTGTACT TCATG 

SEQ ID No. 7 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 25 base 
STRANDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 

CTGCAAAAGC AAAGGCTGCA ACTGC 



24 



EP 0 967 285 A1 



Claims 



1. A recombinant DNA molecule comprising a DNA sequence encoding the Fiavobacterium meningosepticum pro- 
lyl endopeptidase shown in SEQ ID No. 1. 

5 

2. A recombinant DNA molecule according to claim 1 comprising the coding region shown in SEQ ID No. 1 . 

3. A recombinant DNA molecule according to claim 2 which is plasmid pFPEP03 (FERM BP-3466). 

10 4. A recombinant DNA molecule according to claim 1 or claim 2 which is an expression vector comprising a DNA 
sequence coding for prolyl endopeptidase operably linked to an expression control sequence. 

5. A recombinant DNA molecule according to claim 4 wherein the DNA sequence encoding prolylendopeptidase 
operably linked to an expression control sequence is the nucleic acid sequence shown in SEQ ID No. 1 , extending 

is 1 20bp upstream of the translation initiation site and to the BamHI site downstream of the stop codon. 

6. A recombinant DNA molecule according to claim 4 wherein the expression control sequence is the E. coli tac pro- 
moter. 

20 7. A host organism transformed with a recombinant DNA molecule according to any one of claims 1 to 6. 

8. A host according to claim 7, which is E. coli transformed with an expression vector according to any one of claims 
4 to 6, and capable of producing prolylendopeptidase. 

25 9. A process for the production of a recombinant DNA encoding the Fiavobacterium meningosepticum prolylen- 
dopeptidase shown in SEQ ID No. 1, comprising the steps of: 



preparing cDNA or genomic DNA coding for prolylendopeptidase from F. meningosepticum cells or preparing 
DNA coding for the F meningosepticum prolylendopeptidase shown in SEQ ID No. 1 by chemical synthesis, 
30 inserting the DNA into a cloning vector, and selecting a hybrid vector containing DNA coding for prolylen- 

dopeptidase. 

10. A process for production of prolylendopeptidase comprising the steps of: culturing a host organism transformed 
with an expression vector according to any one of claims 4 to 6 and, optionally, recovering the produced prolylen- 

35 dopeptidase. 

11. A process according to claim 10, wherein the host organism is E. coli . 

12. A signal sequence having the amino acid sequence of the signal sequence shown in SEQ ID No. 1. 

40 

Claims for the following Contracting States : ES, GR 

1. A process for the production of a recombinant DNA encoding the Fiavobacterium meningosepticum prolylen- 
dopeptidase shown in SEQ ID No. 1, comprising the steps of: 

45 

preparing cDNA or genomic DNA coding for prolylendopeptidase from F meningosepticum cells or preparing 
DNA coding for the F meningosepticum prolylendopeptidase shown in SEQ ID No. 1 by chemical synthesis, 
inserting the DNA into a cloning vector, and selecting a hybrid vector containing DNA coding for prolylen- 
dopeptidase. 

50 

2. A process according to claim 1 for the preparation of a recombinant DNA molecule comprising the coding region 
shown in SEQ ID No. 1 . 



3. A process according to claim 1 for the preparation of a recombinant DNA molecule which is plasmid pFPEP03 
55 (FERM BP-3466). 

4. A process according to claim 1 for the preparation of a recombinant DNA molecule which is an expression vector 
comprising a DNA sequence coding for prolylendopeptidase operably linked with an expression control sequence. 
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5. A process for the preparation of recombinant DNA molecule according to claim 4 wherein the DNA sequence 
encoding prolylendopeptidase operably linked to an expression control sequence is the nucleic acid sequence 
shown in SEQ ID No. 1, extending 120bp upstream of the translation initiation site and to the BamHI site down- 
stream of the stop codon 

5 

6. A process according to claim 4 for the preparation of a recombinant DNA molecule wherein the expression con- 
trol sequence is the E. coli tac promoter. 

7. A process for the preparation of a transformed host comprising transforming a host organism with a recombinant 
10 DNA molecule as defined in any one of claims 1 to 6. 

8. A process according to claim 7 in which E. coN is transformed with an expression vector as defined in any one of 
clams 4 to 6. 

15 9. A process for production of prolylendopeptidase comprising the steps of: culturing a host organism transformed 
with an expression vector as defined in any one of claims 4 to 6 and, optionally, recovering the produced prolylen- 
dopeptidase. 

10. A process according to claim 9, wherein the host organism is E. coli . 

20 

1 1 . A process for the production of a C-terminal amidated peptide from two precursors thereof, wherein one of the 
precursors is a precursor peptide forming N-terminal region of the C-terminal amidated peptide and having a pro- 
line residue at its C-ter minus and another precursor is a precursor peptide or amino acid forming a C-terminal por- 
tion of the C-terminal amidated peptide which precursor peptide or amino acid has been C-terminal amidated, 

25 comprising the steps of: 

placing the two precursors in contact with F meningosepticum prolylendopeptidase having the sequence 
shown in SEQ ID No. 1 in a medium to convert the precursor peptides to the C-terminal amidated peptide, and 
recovering the resulting C-terminal amidated peptide. 

30 

12. A process for the expression of recombinant protein comprising using a signal sequence having the amino acid 
sequence of the signal sequence shown in SEQ ID No. 1 . 
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